Game Theory

02/06/2024

Game theory studies strategic interactions
and multiperson decision theory

Defination - Normal form game

A normal form game consists of three things:

$N$ players.
$A_i$ $i \in N$ $a_i\in A_i$
$A = A_1\times A_2 \times \dots \times A_N$
$u_i : A \rightarrow \R$ $\forall i \in N$ .

Example
$N-2$ $A_1 = \{a_1,a_2\}$ $A_2 = \{b_1,b_2\}$
$b_1$ $b_2$
$a_1$ 7,3 3,5
$a_2$ 5,5 7,2
The payoff here is (player 1, player 2)

	$b_1$	$b_2$
$a_1$	7,3	3,5
$a_2$	5,5	7,2

Defination (strictly) dominate

$a \in A_1$ (strictly) dominates $a^\prime \in A_1$ $u_1(a,b) > u_1(a^\prime,b)$ $b\in A_2$

$a$ $a^\prime$ 的收益(严格)更好

Definate weakly dominate

$a \in A_1$ weakly dominates $a^\prime \in A_1$ if

$u_1(a,b) \geq u_1(a^\prime,b)$ $b\in A_2$ ,
$u_1(a,b) > u_1(a^\prime,b)$ $b\in A_2$ ,

In summary, strictly domination implies weak domination, and any two actions cannot weakly dominate each other.

And we have some descriptions:

$a \in A_1$ not $a^\prime \in A_1$ .

$a \in A_1$ strictly $a^\prime \in A_1$ .

$a\in A_1$ weakly dominates $a^\prime \in A_1$ .

$a \in A_1$ $a^\prime \in A_1$ $a^\prime$ $a$ .

Then,

A rational player will never chooses a dominated action, and will always chooses a dominant action.

Example

		Player	2
Player		$L$	$R$
1	$U$	$(3, )$	$(0,)$
	$M$	$(1,)$	$(1,)$
	$D$	$(0,)$	$(3,)$

if we do not know the payoffs for player 2, then we have no dominant and dominated actions so far.

	$L$	$R$	$T$
$U$	$(3,)$	$(1,)$	$(0,)$
$M$	$(3,)$	$(0,)$	$(-1,)$
$D$	$(4,)$	$(-1,)$	$(1,)$

Defination - Beliefs

$\sigma_2 \in \Delta(A_2)$ denotes player 1's beliefs about player 2's actions.

Tip

$A_2$ $\Delta(A_2)$ denotes the probabilities of these strategies

Example

$A_2 = \{a,b,c\}$ ,

$\sigma_2(a) + \sigma_2(b) + \sigma_2(c) = 1$ ,

$u_1(a,\sigma_2)$ $a$ $\sum_{a_2 \in A_2} \sigma_2(a_2) u_1(a,a_2)$ .

$\sigma_2$ , a rational player maximizes expected utility.

Since the set of actions are finite, there is always at least one best action.

Defination - Best response

$a\in A_1$ weak best response $\sigma_2$ $u_1(a,\sigma_2) \geq u_1(a^\prime , \sigma_2)$ $a^\prime \in A_1$ .

Note

The best response correspondence is

\begin{matrix} (1) & B R (σ_{2}) = \arg max_{a \in A_{1}} u_{1} (a, σ_{2}) \end{matrix}

Best response could be multiple.

Tip

Example - Sun Rain Game

	Rain God
Player	Sun	Rain
No Umbrella	$5$	$0$
Umbrella	$1$	$3$

$\sigma = \Pr(\text{Sun})$ $\sigma = \sigma_2(\text{Sun})$ .

$\sigma$ .

$U_1(NU) = 5\times \sigma + 0\times (1-\sigma) = 5\sigma$

$U_1(U) = 1\times \sigma + 3\times(1-\sigma) = 3- 2\sigma$

$5\sigma \geq 3-2\sigma$ $\sigma > \frac{3}{7}$ $NU$ $BR(\sigma_2) = NU$

$U$ $BR(\sigma_2) = U$

$\sigma = \frac37$ $BR(\sigma_2) = \{NU,U\}$ .

Defination - Never a best response

$\forall \sigma_2 \in \Delta(A_2)$ $a \not \in BR(\sigma_2)$ $a$ is never (weak) best response.

Proposition

A strictly dominated action is a NWBR.

Proof
$a$ $a^\prime$ $u(a,b) < u(a^\prime,b)$ $u_1(a,\sigma_2) < u_1(a^\prime , \sigma_2)$ $a$ is a "Never a best response (NWBR)".

Tip

NWBR does not necessarily means being strictly dominated by a pure strategy. It could be dominated by a mixed strategy.

Example

	U	D
L	3	0
M	1	1
R	0	3

There is no strictly dominated action,

$P(U) = \sigma$ $P(D) = (1-\sigma)$ , then we have the expected utility for each action

$u_1(L,\sigma) = 3\sigma$
$u_1(M,\sigma) = 1$
$u_1(R,\sigma) = 3 - 3\sigma$

$L$ $M$ .

$\sigma>\frac13$ $u_1(L,\sigma) > u_1(M,\sigma)$ ,

$\sigma< \frac23$ $u_1(M, \sigma) < u_1(R,\sigma)$ ,

$M$ is a NWBR.

Why? because M is dominated by a mixed strategy.

Mixed action

$\sigma_1 \in \Delta(A_1)$ ,

The support of a mixed action is the set of pure actions given positive probability.

\begin{matrix} (2) & Supp (σ_{1}) = {a \in A_{1} : σ_{1} (a) > 0} \end{matrix}

$\sigma_1$ $\sigma_2 \in \Delta(A_2)$ is

\begin{matrix} (3) & \begin{aligned} u_{1} (σ_{1}, σ_{2}) & = \sum_{a \in A_{1}} σ_{1} (a) u_{1} (a, σ_{2}) \\ = \sum_{a_{1} \in A_{1}} \sum_{a_{2} \in A_{2}} σ_{1} (a_{1}) σ_{2} (a_{2}) u_{1} (a_{1}, a_{2}) \end{aligned} \end{matrix}

$a \in \text{Supp}(\sigma_1)$ , and

\begin{matrix} (4) & \begin{aligned} u_{1} (a, σ_{2}) & \geq u_{1} (a^{'}, σ_{2}) \forall a^{'} \in Supp (σ_{1}) \\ u_{1} (σ_{1}, σ_{2}) & \leq u_{1} (a, σ_{2}) \end{aligned} \end{matrix}

This means that a mixed action cannot yield a payoff higher than the best pure action in its support, since the paoff of the mixture is a convex combination of payoff of the pure action in the support.

Tip

But the mixed action is also something we needed, since it is conditional on the belief, it could be different conditional on other beliefs.

Warning

mixed strategy as best response

Suppose we extend the defination of the best response to mixed actions.

\begin{matrix} (5) & B R (σ_{2}) = \arg max u_{1} (σ_{1}, σ_{2}) σ_{1} \in Δ (A_{1}) \end{matrix}

$\sigma_1$ $\sigma_2$ ,

$a\in \text{Supp}(\sigma_1)$ $\sigma_2$ .

Proof as homework

$\sigma_1$ $\sigma_2$ $a_j \in \text{Supp}(\sigma_1)$ $\sigma_2$ $a_k \in \Delta(A_1) , (a_k \neq a_j)$ is the best respose:

\begin{matrix} (6) & a_{k} = B R (σ_{2}) = \arg max_{a \in A_{1}} u_{1} (a, σ_{2}) \end{matrix}

$u_1(a_k,\sigma_2) > u_1(a_j, \sigma_2)$ .

$\sigma_1$ $\sigma_2$ in this case, because

\begin{matrix} (7) & \begin{aligned} u_{1} (σ_{1}, σ_{2}) & = \sum_{a_{- j} \in Supp (σ_{1})} σ_{1} (a_{- j}) u_{1} (a_{- j}, σ_{2}) + σ_{1} (a_{j}) u_{1} (a_{j}, σ_{2}) \\ < \sum_{a_{- j} \in Supp (σ_{1})} σ_{1} (a_{- j}) u_{1} (a_{- j}, σ_{2}) + σ_{1} (a_{k}) u_{1} (a_{k}, σ_{2}) = u_{1} (σ_{1}^{*}, σ_{2}) \end{aligned} \end{matrix}

$\sigma_1$ is the best response.

$a\in \text{Supp}(\sigma_1)$ $\sigma_2$ .

Note

$a \in A_1$ $\sigma_1 \in \Delta(A_1)$ $u_1(a,b) < u_1(\sigma_1,b)$ $b\in A_2$ .

Proposition

$a \in A_1$ $a$ is strictly dominated by a mixed action.

Proof as homework.

$a\in A_1$ $\Leftrightarrow$ $a$ is strictly dominated by a mixed action.

$\Rightarrow$ part:
$a_j\in A_1$ $\forall \sigma_2 \in \Delta(A_2)$ $a \not \in BR(\sigma_2)$ $a \in A_1$ $u_1(a,\sigma_2)> u_1(a_j,\sigma_2)$ $\forall \sigma_2 \in \Delta(A_2)$ $a_{j}$ $a_j$ $\sigma_2^\prime$ $u_1(a_j,\sigma_2^\prime) \geq u_1(a_{-j},\sigma_2^\prime)$ $a\in A_1$ $\Rightarrow$ $a$ is strictly dominated by a mixed action.
$\Leftarrow$ part:
$a$ $\sigma_1$ $u_1(a,\sigma_2) < u_1(\sigma_1,\sigma_2), \ \forall \sigma_2$ $a \not \in BR(\sigma_2)$ $\forall \sigma_2$ $a \in A_1$ is an NWBR.

Solution concepts

Equilibrium is in undominated strategies. We have already known that a rational player will not play a dominated action, sometimes this fact by itself is enough to give a prediction for the solution of the game
Example
$(D,D)$ is the solution.
Warning
Solution is always a pair instead of a strategy
Example - Sun rain game
There is no strictly dominated action for player 1,
$R$ $S$ $S$ $S$ .
$R$ for player 2.
$(NU,S)$ is the outcome for this game

Notice that we obtain this solution by iteratively deleting the strictly dominated actions.
Let's see what happens if we delete weakly dominated actions
$T$ $B$ $B$ $B$ .

$(T,L)$ $(T,R)$ can be the solution of the game
$R$ $(T,L)$
Warning
Eliminating weakly dominated strategy could be risky of losing solutions, and the final solution depends on the orders of deleting. So we never eliminate weakly dominated strategies.
Consider the following case
P1\P2 D E F
A $(1,1)$ $(1,1)$ $(2,1)$
B $(1,1)$ $(0,0)$ $(3,1)$
C $(1,2)$ $(1,3)$ $(1,1)$
Starting P1
$C$ $A$ $E$ $F$ $A$ is also weakly dominated
$(B,D)$ $(B,F)$ as the solution.
Nash equlibrium
$a^* = (a_1^*,\dots , a_N^*)$ $i$ ,
$\begin{matrix} (8) & u_{i} (a_{i}^{*}, a_{- i}^{*}) \geq u_{i} (a_{i}, a_{- i}^{*}) for every a_{i} \in A_{i} \end{matrix}$
$a_i^* \in BR(a_{-i}^*)$ $i\in N$
$\sigma^* = (\sigma_1^*, \dots, \sigma_N^*)$ $i$ ,
$\begin{matrix} (9) & B R (σ^{*}) \in B R (σ_{- i}^{*}) \end{matrix}$
$a_i \in BR(\sigma^*_{-i})$ $a_i \in \text{Supp}(\sigma_i^*)$ .
Example
Prinsion Dilemma:
Consider the best response,
- $D\in BR(D)$ $D\in BR(D)$ .
  $(D,D)$ is a Nash equilibrium.
  $C$ $C$ $C$ will be dominated as well and becomes a NWBR.
Example - Sun rain game
Consider this example again.
- $BR_1(S) = NU$ $BR_1(R) = U$ .
- $BR_2(NU) = S$ $BR_2(U) = S$ .
- $(NU,S)$ .
Now, what about if we have this payoff?
$(NU,S)$ $(U,S)$ .
And we also have a mixed strategy nash equilibrium.
Proposition
Tip
Any action played positive probability in any NE must survive iterated deletion of strictly dominated actions.
Proof as homework
Suppose there is an action that is played positive probability in a NE

Looking for mixed action equilibrium
Example - Battle of the sexes
$(O,O)$ $(B,B)$ .
$O$ $\sigma_2$ $O$ $\sigma_1$ .
$O$ $B$ which implies conditions on the action that your partner is playing
$O$ $u_1(O, \sigma_2) = 2\sigma_2$ $u_1(B, \sigma_2) = 1-\sigma_2, \Rightarrow \sigma_2 = \frac13$ .
$O$ $B$ to be willing to mix.
$u_2(O,\sigma_1) = \sigma_1, u_2(B,\sigma_1) = 2-2\sigma_1, \Rightarrow \sigma_1 = \frac23$ .
$\left(\sigma_1,\sigma_2 \right) =\left(\frac23,\frac13\right)$ is the mixed strategy nash equilibrium in this example.
,Example - matching pennies
This is a zero sum game:
- $u_1(a, a^\prime ) + u_2(a,a^\prime ) = 0$ $a\in A_1$ $a^\prime \in A_2$ .
In this case, we do not have any best response to best responses. So we do not have any pure strategy nash equilibrium. But this does not imply that we cannot find any mixed strategy equlibrium, we need to check it.
$(\sigma_1, \sigma_2)$ is a mixed action NE.
$Eu_1(H) = 2\sigma_2-1, Eu_1(T) = 1-2\sigma_2, \Rightarrow \sigma_2 = 0.5$ .
$\sigma_1 = 0.5$ .
$(\sigma_1, \sigma_2) = \left(0.5,0.5\right)$ .
Tip
Even if we cannot find a pure strategy nash equilibrium, it is possible to find a mixed strategy.
Typically we will find an odd number of nash equilibriums.
Example - Cournot duopoly
$q_i \in A_i = \R^+$ $Q = q_1 + q_2$ ,
The payoffs are
$\begin{matrix} (10) & u_{i} (q_{i}, q_{j}) = q_{i} P (Q) - c_{i} (q_{i}) \end{matrix}$
$P(Q)$ $Q$ $c_i(q_i)$ is the cost of production.
$P(Q) = a - Q$ $c_i = c(q_i)$ $i$ $a>c$ ,
The best response corresponces are
$\begin{matrix} (11) & B R (q_{j}) = \arg max_{q_{i}} q_{i} (a - q_{i} - q_{j}) - c q_{i} \end{matrix}$
Suppose there exists an interior solution,
FOC:
$\begin{matrix} (12) & a - 2 q_{i} - q_{j} - c = 0 \end{matrix}$
Therefore,
$\begin{matrix} (13) & q_{i} (q_{j}) = \frac{a - q_{j} - c}{2} if a - q_{j} - c > 0 \end{matrix}$
$q_i(q_j) = 0$ otherwise.
Then substitute it back to the b est response
$\begin{matrix} (14) & q_{i}^{*} = \frac{a - c - \frac{a - c - q_{i}^{*}}{2}}{2} \Rightarrow q_{i}^{*} = \frac{a - c}{3} \end{matrix}$
This is a simultaneously move game
Question
1. Suppose that a game has two NE. Suppose one of the NE involves weakly dominated actions and the other doesn't, which one is less likely to be played?
2. Or suppose one of them is Pareto dominated by another NE. which one is less likely to be played?
Let's have an example for illustration,
L R
T $(10,0)$ $(5,2)$
B $(10,11)$ $(2,11)$
$(T,R)$ $(B,L)$
$(B,L)$ $(T,R)$ .
$T$ $R$ $L$ $(B,L)$ is composed of weakly dominated actions.
$(T,R)$ $(B,L)$ , the answer becomes ambiguous.
Tip
Theorem
Any finite game has at least one NE.
Proof as homework
$\Phi$ $\Phi$ $\sigma^{(0)}$ $\sigma^{(n)}=\Phi\left(\sigma^{(n-1)}\right)$ $\Phi$ must be a Nash equilibrium, hence the proof. The space of mixed strategy profiles is clearly compact, since it can be described as:
$\begin{matrix} (15) & {(α_{i}^{(s_{i})}) : \forall i, \sum_{s_{i} \in S_{i}} α_{i}^{(s_{i})} = 1; \forall i, \forall s_{i} \in S_{i}, 0 \leq α_{i}^{(s_{i})} \leq 1} \end{matrix}$
$\alpha=\left(\alpha_i^{\left(s_i\right)}\right)$ $i$ $u_i$ to mixed strategies)
$\begin{matrix} (16) & u_{i} (α) = \sum_{j} \sum_{s_{j} \in S_{j}} α_{j}^{(s_{j})} u^{i} ({(s_{j})}_{j}) . \end{matrix}$
$i$ $s \in S_i$ $\left(\alpha_i^{\left(s_i\right)}\right)_{s_i}$ would be
$\begin{matrix} (17) & u_{i} (s, α_{- i}) = \sum_{j \neq i} \sum_{s_{j} \in S_{j}} α_{j}^{(s_{j})} u^{i} (s, {(s_{j})}_{j \neq i}) . \end{matrix}$
$s \in S_i$ $p_i(s, \alpha)=u_i\left(s, \alpha_{-i}\right)-u_i(\alpha)$ $\Phi$ $i$ $s \in S_i$ $p_i(s)>0$ $\Phi(\alpha)=\alpha^{\prime}$ , with
$\begin{matrix} (18) & α_{i}^{' (s_{i})} = \frac{α_{i}^{(s_{i})} + max (p_{i} (s_{i}, α), 0)}{1 + \sum_{s \in S_{i}} max (p_{i} (s, α), 0)} \end{matrix}$
$\Phi$ is continuous. Finally, it is easy to see that
$\begin{matrix} (19) & \sum_{s : p_{i} (s, α) > 0} α_{i}^{(s)} = \sum_{s : p_{i} (s, α) > 0} \frac{α_{i}^{(s)} + p_{i} (s, α)}{1 + \sum_{s^{'} : p_{i} (s^{'}, α) > 0} p_{i} (s^{'}, α)} \geq \sum_{s : p_{i} (s, α) > 0} α_{i}^{(s)}, \end{matrix}$
$p_i(s, \alpha) \leq 0$ $s$ $\Phi$ $\sum_{s: p_i(s, \alpha)>0} \alpha_i^{\prime(s)}=\sum_{s: p_i(s, \alpha)>0} \alpha_i^{(s)}$ $i$ $p_i(s, \alpha) \leq 0$ $i$ $s \in S_i$ , hence must be a Nash equilibrium. This concludes the proof of the existence of a Nash equilibrium.

JR Theorem 7.2 Proof
$G=\left(S_i, u_i\right)_{i=1}^N$ $n$ $i$ $n$ $S_i=\{1,2, \ldots, n\}$ $u_i\left(j_1, j_2, \ldots, j_N\right)$ $i$ $j_1$ $j_2, \ldots$ $N$ $j_N$ $i$ $M_i=\left\{\left(m_{i 1}, \ldots, m_{i n}\right) \in \mathbb{R}_{+}^n \mid\right.$ $\left.\sum_{j=1}^n m_{i j}=1\right\}$ $m_{i j}$ $i$ $j$ $M_i$ is non-empty, compact, and convex.
$G$ $G$ $G$ .
$f: M \rightarrow M$ $m \in M$ $i$ $j$ , let
$\begin{matrix} (20) & f_{i j} (m) = \frac{m_{i j} + max (0, u_{i} (j, m_{- i}) - u_{i} (m))}{1 + \sum_{j^{'} = 1}^{n} max (0, u_{i} (j^{'}, m_{- i}) - u_{i} (m))} \end{matrix}$
$f_i(m)=\left(f_{i 1}(m), \ldots, f_{i n}(m)\right), i=1, \ldots, N$ $f(m)=\left(f_1(m), \ldots, f_N(m)\right)$ $i, \sum_{j=1}^n f_{i j}(m)=1$ $f_{i j}(m) \geq 0$ $j$ $f_i(m) \in$ $M_i$ $i$ $f(m) \in M$ .
$f_{i j}$ $m$ $m$ $f_{i j}$ $m$ $i$ $j$ $f$ $M$ $f$ $\hat{m}$ .
$f(\hat{m})=\hat{m}$ $f_{i j}(\hat{m})=\hat{m}_{i j}$ $i$ $j$ $f_{i j}$ ,
$\begin{matrix} (21) & {\hat{m}}_{i j} = \frac{{\hat{m}}_{i j} + max (0, u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m}))}{1 + \sum_{j^{'} = 1}^{n} max (0, u_{i} (j^{'}, {\hat{m}}_{- i}) - u_{i} (\hat{m}))} \end{matrix}$
or
$\begin{matrix} (22) & {\hat{m}}_{i j} \sum_{j^{'} = 1}^{n} max (0, u_{i} (j^{'}, {\hat{m}}_{- i}) - u_{i} (\hat{m})) = max (0, u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m})) . \end{matrix}$
$u_i\left(j, \hat{m}_{-i}\right)-u_i(\hat{m})$ $j$ gives:
$\begin{matrix} (23) & \begin{matrix} \sum_{j = 1}^{n} {\hat{m}}_{i j} [u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m})] \sum_{j^{'} = 1}^{n} max (0, u_{i} (j^{'}, {\hat{m}}_{- i}) - u_{i} (\hat{m})) \\ = \sum_{j = 1}^{n} [u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m})] max (0, u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m})) \end{matrix} \end{matrix}$
Now, a close look at the left-hand side reveals that it is zero, because
$\begin{matrix} (24) & \begin{aligned} \sum_{j = 1}^{n} {\hat{m}}_{i j} [u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m})] & = \sum_{j = 1}^{n} {\hat{m}}_{i j} u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m}) \\ = u_{i} (\hat{m}) - u_{i} (\hat{m}) \\ = 0 \end{aligned} \end{matrix}$
$m_{i j}$ $j$ . Consequently, (P.1) may be rewritten
$\begin{matrix} (25) & 0 = \sum_{j = 1}^{n} [u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m})] max (0, u_{i} (j, {\hat{m}}_{- i}) - u_{i} (\hat{m})) . \end{matrix}$
$u_i\left(j, \hat{m}_{-i}\right)-u_i(\hat{m}) \leq 0$ $j$ $u_i\left(j, \hat{m}_{-i}\right)-u_i(\hat{m})>0$ $j$ $j$ $\hat{m}$ is a Nash equilibrium.
Theorem 7.2 is quite remarkable. It says that no matter how many players are involved, as long as each possesses finitely many pure strategies there will be at least one Nash equilibrium. From a practical point of view, this means that the search for a Nash equilibrium will not be futile. More importantly, however, the theorem establishes that the notion of a Nash equilibrium is coherent in a deep way. If Nash equilibria rarely existed, this would indicate a fundamental inconsistency within the definition. That Nash equilibria always exist in finite games is one measure of the soundness of the idea.

P1\P2	D	E	F
A	$(1,1)$	$(1,1)$	$(2,1)$
B	$(1,1)$	$(0,0)$	$(3,1)$
C	$(1,2)$	$(1,3)$	$(1,1)$

	L	R
T	$(10,0)$	$(5,2)$
B	$(10,11)$	$(2,11)$

Extensive form games

Recall the battle of sexes game,

$(O,O)$ $(B,B)$ . This is a simultaneous move game.

$(O,O)$ will be played.

Integrents of extensive form game

$0,1, \dots, N$ $0$ $N$ to denote all individuals, the set of individuals)
$y\in Y = X \cup Z$
- $x \in X$
- $z \in Z$
Directions
predecessors and successors
- $x_0$ ) has one immodiate predecessor.
- Terminal nodes have no successor.
$x_0$ and all its successors.
- A subtree is any node and its successors
Whose decision function
$i: X \rightarrow N$
$i(x)$ $x$
$A(x)$ $i(x)$ $x$ .
Each different act8ion leads to a different successor node.

Information sets

$H$ is a partition of decision node into information sets

A player cannot distinguish between nodes in an information set but can distinguish between information sets.

$h(x)$ $x$
$x^\prime \in h(x),$ $i(x) = i(x^\prime )$
$x^\prime \in h(x)$ $A(x) = A(x^\prime)$ .
Nature's moves: nature moves randomly according to commonly known probability
$u_i: \Z \rightarrow \R$ $\Z$ is terminal nodes) that specify a player's utility payoff at each terminal node

Properties of extensive form game

A game has perfected recall if players don't forget anything during the game
A game has perfect information if each information set contains only one node: that is if playersknow the history of moves so far.
For example in this game we have four information sets, three of them contains more than one nodes, so this game is not a perfect information game. It is simultaneously move game, which means any player does not know the other player's action.
A game has complete information if the structure of the game is common knowledge (each player could draw the game tree).
we will usually transform incomplete information games into imperfect information games by letting Nature choose the structure randomly and unobservably.
Important
Complete info and Perfect info

Strategies and payoffs

In extensive form game, we will distinguish strategies and actions. we don't do that for normal games.

$H_i = \{h\in H, i(h) = i \}$ $i$ $i$ $A_i = \cup_{h\in H_i}A(h)$

pure strategy $i$ $s_i : H_i \rightarrow A_i$ $s_i(h) \in A(h)$ $h\in H_i$

$S_i$ $i$ $S = S_1\times S_2 \times \dots \times S_N$ .

A strategy is a complete contingent plan of actions.

A pure strategy specifies a player's choice of action of each of her information sets, even for sets that are not reached.

Tip

Example: In this game we have 3 information sets for 2 players.

Then we define Mixed strategy ,

$\sigma \in \Delta(S_i)$ $i$ is a probability distribution over pure strategies.

$\left\{\sigma_1 (Tt) = \sigma_1(Tb) = \sigma_1(Bt) = \frac13\right\}$

We know that we can transform extensive form game to normal form game by writting strategies and their corresponding payoffs. Then using this table we can calculate the mixed strategy nash equilibrium.

$S_1/S_2$	$L$	$R$
$Bb$	$(2,3)$	$(2,3)$
$Bt$	$(2,3)$	$(2,3)$
$Tb$	$(1,1)$	$(3,2)$
$Tt$	$(1,1)$	$(0,0)$

A Nash equilibrium of an extensive form game is a NE of the corresponding normal form game.

Refinement of NE

Backward Induction
is the process of analyzing a game from end to begin.
Example
A parent is driving Disneyland with a child in the back seat. The child is making a lot of noise. Parent says "be quiet, or I will turn this car around." This situation is represented in the following extensive form game, where the child's payoffs are given first.
Write down the norm form game:
TT TD DT DD
Q -10,-5 -10,-5 5,10 5,10
N -5,-10 10,5 -5,-10 10,5
There are three pure strategy nash equilibrium here in the norm form game,
$PSNE = \{(N,TD),(Q,DT), (N,DD)\}$ .
$(N,DD)$ .
$(Q,DT)$ $x_3$ (left lower node)? No
So is that a credible threat? No
$x_3$ $x_2$ . Anticipating these decisions, child know that if he chooses quiet he will end up with payoff of 5 and if he choose noise he will end up with payoff of 10.
$(N,DD)$ is the backward induction solution to this game and it's also a NE.
Important
Proposition
Every backward induction solution of an extensive form game is NE.
Every finite perfect information game has at least one backward induction solution in pure strategies. Thus every such game has a PSNE.
But backward induction solutions may involve weakly dominated strategies.
Example
Consider this game:
$S_1 = \{A,B\}$ $S_2 = \{C,D\}$ .
Consider the following normal form game:
C D
A 1,1 1,1
B 1,1 0,0
$(A,C)$ $(B,C)$ $B$ is weakly dominated.
Example
$L$ because it is strictly dominated.
If we change the payoff to this scheme:

We need subgame perfect

Defination
A subgame is a sub tree such that
1. Starts at a decision node and
2. Contains no broken information set
That is if an information set contains a node in the subgame, then every node in that information set is contained in that subgame.
Example:
$x_1$ $x_2$ .

Defination - SPNE
A subgame perfect equilibrium is a profile of strategies where restrictions to any subgame forms a NE of that subgame.
Note
Every subgame perfect equilibrium is a NE (To the whole game), if a game has only one subgame, which is itself, then every NE is a subgame perfect Nash Equilibrium (SPNE).
Tip
In games of perfect information (we do not have any info set that contains multiple nodes, i.e. no simultaneous move game), the set of subgame perfect equilibria and the set of Backward induction are the same
Our backward induction is actually deriving a SPNE.
The advantage of subgame perfection is that it is defined even for games with imperfect information.
TO find a SPNE in an imperfect information game, the procedure is similar to backward induction. we just replace subgames at the end of the game with their equilibrium payoffs, and repeat until you reach the initial node.
Example
There are two subgames in this game.
For the second subgame, the normal form is,
P3 L R
Player 2
T 0,1 1,0
B 1,1 0,0
$(B,L)$ $x_2$ .
$x_2$ $(2,1,1)$ $D$ $(D,B,L)$ .
Next, consider the following exercises: find all the Nash equilibriums in this game (Hint: rewrite this game into a normal form game, we typically need two matrixes)
- $U$ is played by P1,
- L R
  T 1,10,10 ✅ 1,10,10 ✅
  B 1,10,10 1,10,10 ✅
  $D$ is played by P1,
  L R
  T 0,0,1 0,1,0 😂
  B 2,1,1 ✅ 0,0,0
Example
$(ddd,dd)$ . but we do have other Nash equilibriums.

Repeated Games
Let's add a couple of actions to the battle of sexes.
We have two pure strategy nash equilibriums.
$c_1$ $c_2$ .
O B C2
O 2,1 0,0 6,0
B 0,0 1,2 0,0
C1 0,0 0,0 5,5
$c_1$ $c_2$ $(O,O)$ $(B,B)$ $(c_1, c_2)$ $(5,5)$ $O$ .
$G(2)$ , where this game is played twice, First they play the game once, then both players actions are resolved to each other,and they play the game again.
$G(2)$ are the sum of the payoffs in the two stages.
$(c_1, c_2)$ $i$ is a pair of
$\begin{matrix} (26) & s_{i}^{1} \in {O, B, c_{i}} \end{matrix}$
, and in the second stage
$\begin{matrix} (27) & s_{i}^{2} \in {O, B, c_{1}} \times {O, B, c_{2}} \end{matrix}$
They represents your strategy in the first and second stage,
$G(2)$
$\begin{matrix} (28) & \begin{matrix} s_{i}^{1} = c_{i} \\ s_{i}^{2} = {\begin{cases} O if a_{- i} = c_{- i} \\ N o.w. \end{cases} \end{matrix} \end{matrix}$
$(O,O)$ $(B,B)$ $c_1$ $(O,O)$ $(B,B)$ is played.
$5+2 = 7$ $6+1 =7$ , so deviating is not profitable. In the second round (stage), he has no incentive to deviate.
For the woman, she does not has any incentive to deviate. She plays a statistic best response in both periods.
$B,B$ $(O,O)$ are NE of the second stage game.

Now, lets define this game formally.
Defination
$G$ $\delta \in [0,1]$ $G(T,\delta)$ $\delta$ .
$h^t = (a^1, a^2,\dots, a^{t-1})$ $a^s \in A$ $s<t$ $h^t$ $t$ $H^t$ $t$ histories.
$h_1^2 = (a_1)$ $a^1 = (a_0,a_1)$ .
$h^3 = (a^1)$
$H^{T+1}$ is the set of complete histories of the game which correspond to terminal nodes.
$S_i = (s_{i}^1, \dots, s_i^T)$ $S^t_i$ $S_i^t : H^t \rightarrow A_i$ $h^t$ $i$ $s^t_i(h) \in A_i$ $i$ $S_i$ .
$S$ $h^{T+1}(S)$ $a^1 = s^1(h^1)$ $a^1 = s^1(h^1)$ $a^t = s^t(a^1, \dots, a^{t-1})$ $2\leq t\leq T$ .
Like the previous example,
$a^2 = s^2(a^1)$ ,
$s^1_1(h^1) = a_1$ $s_2^1(h^1) = a_2 \Rightarrow a^1 = (a_1, a_2) \rightarrow h_2^2$ .
$a_1^2 = s_1^2(a^1) = s_1^2((a_1, a_2)) = s_1^2(h^2_2)$ .
$\delta \in [0,1]$ is the "discount factor". For simplicity, we will assume that is the some fof all players.
Definations
$i$ $h^{T+1}$ $i$ 's payoffs in each stage/period, Exponentially weighted by the discount factor.
$h^{T+1} = (a^1, a^2, \dots, a^T)$
$\hat u_i(h^{T+1}) = u_i(a^1) + \delta u_i(a^2) + \dots + \delta^{T-1} u_i(a^T)$ .
It will sometimes be convenient to rescale these payoffs so that they are directly comparable do the stage game payoff
$\begin{matrix} (29) & {\hat{u}}_{i} (h^{T + 1}) = \frac{1 - δ}{1 - δ^{T}} (u_{i}) \end{matrix}$
$\delta = 1$ .
$T \rightarrow \infin$ $\hat u_i(h^\infin) = (1-\delta) u_i$ $\hat u_i (h^\infin) = (1-\delta) \sum_{t = 1}^\infin \delta^{t-1} u_i(a^t)$ .
$h^{T+1} = \left(a,a,\dots, a\right)$ .
$\begin{matrix} (30) & {\hat{u}}_{i} (h^{T + 1}) = \frac{1 - δ}{1 - δ^{T}} \sum_{t = 1}^{T - 1} u_{i} (a) = \frac{1 - δ^{T}}{1 - δ} c \end{matrix}$
$u_i(a) = c$
$\hat u_i (h^{T+1}) = u_i(a)???$
Tip
Finitely repeated games
$T$ $T<\infin$ .
C D
C -1,-1 -4,0
D 0,-4 -3,-3
There is a unique NE for stage game.
Here becasue each subgame only have one nash equilibrium, so when we do backward induction, we can degenerate the game tree always with a certain equilibrium. So there will be no room for punishment to deviate from the nash equilibrium at each stage.
$G(T,\delta)$ $(D,D)$ $T$ $T-1$ $(D,D)$ $(D,D)$ will be played in every period.
Proposition
$G$ $a^*$ $T<\infin$ $G(T,\delta)$ $a^*$ in every period.
$t$ , we could specify a subgame equilibrium to be played in that period regardless of history

$a_{-i}$ $i$ $w_i(\alpha_{-i}) = \max u_i(a_i,a_{-i})$ $i$ $\alpha_{-i}$ .
minmax $\nu_i$ $i$ is
$\begin{matrix} (31) & ν_{i} = min_{α_{- i}} w_{i} (α_{- i}) = min_{α_{- i}} max_{α_{i}} u_{i} (a_{i}, a_{- i}) \end{matrix}$
$i$ 's maximum payoff. when he best responds.
$i$ is
$\begin{matrix} (32) & m_{i} = \underset{α_{- i}}{argmin} w_{i} (α_{- i}) \end{matrix}$
Proposition
$T$ $\delta$ $i$ $G(T,\delta)$ $\nu_i$ .
Note
Proof idea
$i$ $\nu_i$ $\nu_i$
Tip
Example
$C$ $D$ $A$ $C$ $\nu_2 = \min_{A,B} \{1,3\} = 1$ $m_2 = A$
$\nu_1$ $C$ $D$ $D$ $\nu_1 = 0$ $m_1 = D$ .
Tip
Example
Think about the mixed strategy.
$m_1$ $\nu_1$ $H$ $p$ $w_1(p) = \max \{2p-1, 1-2p\}$ $p \in [0,1]$ .
- $p\neq 0.5$ $w_1(p) > 0$
- $p<0.5$ $1-2p>0$
- $p>0.5$ $2p-1>0$ .
- $m_1 = \sigma_2(H) = 0.5$ $\nu_1 = 0$
- $m_2 = \sigma_1(H) = 0.5$ $\nu_2 = 0$ .
Infinitely Repeated Games
The principle of optimality (also known as the one shot deviation principle) is most useful when we deal with in finitely repeated games Principle of optimality
$\sigma$ $\sigma$ $\sigma$ again afterwards.
The principle of optimality makes checking for subgame perfection much easier by reducing the number of deviations that we need to check. The idea is that if there is any profitable deviation, then there must be profitable one-shot deviation.
Tip
Example
Strategy:
$S_i^{\prime}=C_i$ $t>1 \quad s_i^t\left(h^t\right)= \begin{cases}c_i & \text { if } a_1^s=C_1, \text{and } a_2^s = C_2 \\ N . & \text { otherwise }\end{cases}$
$(C_1,C_2)$ $(N,N)$ is played forever. There ore two types of histories that we need to check: those when no one has deviated and those after a deviation
$5+ \delta 5+\delta^2 5+\dots = \frac{5}{1-\delta}$ .
$J$ .
$6 + \delta + \delta^2 +\dots = 6 + \frac{\delta}{1-\delta}$ .
$N$ $J$ $0+ \frac{2\delta}{1-\delta}$ .
$\frac{1}{1-\delta}$ . Now any further deviation does not affect future play so again man's best deviation is his best static deviation.
Refinement of SPNE
$(T,R)$ $(B,L)$ .
$L$ $R$ $R$ $L$ . The action/strategy is conditionally strictly dominated.
$(T,R)$ is SPNE, no belief that we will play at there.
We ought to require that at nontrival info sets, players maximized expected utility according to some beliefs about which node they are at.
Tip
Example
There are two subgames.
$(Du,a,L)$ $b$

Perfect Bayesian Equilibrium
$x$ $i$ $h$ $\mu_i(x\mid h)$ $i$ $x$ $h$
Tip
Understanding in this way:
$\begin{matrix} (33) & \begin{aligned} \Pr (h) = \Pr (x_{1}) + \Pr (x_{2}) \\ \Pr (x_{1} ∣ h) = \frac{\Pr (x_{1})}{\Pr (h)} = \frac{\Pr (x_{1})}{\Pr (x_{1}) + \Pr (x_{2})} \\ \Pr (x_{1} ∣ h) + \Pr (x_{2} ∣ h) = 1 \end{aligned} \end{matrix}$
nature $\sigma(\text{including nature's moves})$ Bayes rule pins down conditional belief at info sets that are reached with positive probability.
Defination
[Bleliefbelief $(\sigma, \mu)$ $\mu: X \rightarrow [0,1]$ , gives conditional beliefs of each information set including the trivial ones.
Note
$\mu$ $\Gamma_E$ $\mu(x) \in[0,1]$ $x$ $\Gamma_E$ $\sum_{x \in H} \mu(x)=1$ $H$ .
- A system of beliefs specify, for each information set, a probabilistic assessment by the player who moves there, conditional upon play having reached that information set.
- $E\left[u_i \mid H, \mu, \sigma_i, \sigma_{-i}\right]$ $H$ $\mu$ $\sigma_i$ $\sigma_{-i}$ .
Sequentially rational $(\sigma, \nu)$ sequentially rational $\sigma_i$ $\mu$ $i$ $i$ 's information sets
$\sigma$ $\mu$
Note
A formal statement
$\sigma$ $\Gamma_E$ $H$ $\mu$ $\iota(H)$ $H$ , we have
$\begin{matrix} (34) & E [u_{ι (H)} ∣ H, μ, σ_{ι (H)}, σ_{- ι (H)}] \geq E [u_{ι (H)} ∣ H, μ, {\tilde{σ}}_{ι (H)}, σ_{- ι (H)}] \end{matrix}$
$\tilde{\sigma}_{\iota(H)} \in \Delta\left(S_{\iota(H)}\right)$ $\sigma$ $H$ $\sigma$ is sequentially rational given belief system.
$(\sigma, \mu)$ is a Perfect Bayesian equilibrium ((PBE)), if
- it is sequential rational , and
- $\mu$ $\sigma$ "Whenever possible"
Note
Proposition
$(\sigma, \mu)$ $\sigma$ is SPNE,
Proof by intuition
$\sigma$ $\sigma$ $\sigma^\prime$ $H$ $\sigma$ $H$ .
$\iota(H)$ $H$ $u_{\iota(H)}(\sigma^\prime) > u_{\iota(H)}(\sigma)$ .
$\mu$ , we have
$\begin{matrix} (35) & E [u_{ι (H)} ∣ H, μ, σ_{ι (H)}^{'}, σ_{- ι (H)}] \geq E [u_{ι (H)} ∣ H, μ, σ_{ι (H)}, σ_{- ι (H)}] \end{matrix}$
$\sigma$ $(\sigma,\mu)$ cannot be the PBE.
$(\sigma, \mu)$ $\sigma$ is SPNE.

Tip
Example
In order to find PBE, we know it must be a subset of SPNE, so we can first find the set of SPNE.
In this example, we only find one SPNE, so it must be part of PBE.
$D$ $B$ , so player 3's belief is updated.
$(D,B,R)$ $\mu_3(x_3\mid h) = 0$ $\mu_3(x_4\mid h) = 1$
Another example,
$(T,R)$ $(B,L)$ are the SPNE,
$(T,R)$ is not a part of PBE, why?
$B$ $\mu_3(x_3 \mid h) = 1$ .
$\begin{matrix} (36) & ((B, L), μ_{2} (x_{2} ∣ h) = 0) \to P B E \end{matrix}$
Tip
Example
$In$ $Out$ $L$ $R$ .
We only have one subgame,
$L,R$ $0.5$ . so the expected utility is then,
$(Out, a)$ $(In, b)$
They are SPE,
$\mu(x_1 = 0.5), \mu(x_3 = 0.5)$
Then,,
$\begin{matrix} (37) & ((I n, b), μ (x_{1}) = 0.5, μ (x_{3}) = 0.5) \to P B E \end{matrix}$
Homework

	TT	TD	DT	DD
Q	-10,-5	-10,-5	5,10	5,10
N	-5,-10	10,5	-5,-10	10,5

	C	D
A	1,1	1,1
B	1,1	0,0

	L	R
Player 2
T	0,1	1,0
B	1,1	0,0

	L	R
T	1,10,10 ✅	1,10,10 ✅
B	1,10,10	1,10,10 ✅

	L	R
T	0,0,1	0,1,0 😂
B	2,1,1 ✅	0,0,0

	O	B	C2
O	2,1	0,0	6,0
B	0,0	1,2	0,0
C1	0,0	0,0	5,5

	C	D
C	-1,-1	-4,0
D	0,-4	-3,-3

Auction

Consider the following sealed -bid first price auction,

Sealed-bid, no knowledge about the other participants
first price: the bid with highest price wins.
single object
$\{0,1/3\}$
two risk neutral bidders (maximizes expected payoff)
$i$ $\nu_i$ ) is a private information
$\nu_i$ $U[0,1]$ . this distribution is known.
$0$ $1/3$ .
higher bidder wins and pays her bid
ties broken by fair coin.

[Solution]

$t^* \in [0,1]$ , such that

\begin{matrix} (38) & \begin{matrix} s_{i}^{*} (t_{i}) = 0 if t_{i} \in [0, t^{*}] \\ s_{i}^{*} (t_{i}) = \frac{1}{3} if t_{i} \in [t^{*}, 1] \end{matrix} \end{matrix}

$t^*$ $i$ $0$ $1/3$ .

$s_2^*()$ ,

$\pi_1(\frac13, s_2^*(t_2), t_1)$ $t_1$ $t_2$ $s_2^*(t_2)$ $1/3$ ,

$E\left[\pi_1(\frac13, s_2^*(t_2), t_1)\right] = \Pr(t_2 > t_2^*)\times \frac12 \times (t_1-\frac13) + \Pr(t_2<t_2^*) \times (t_1 - \frac13) = \frac12 t_2^* (t_1 - \frac13) - \frac12 t_2^* \frac13$ .

$E\left[\pi_1(0, s_2^*(t_2), t_1)\right] = \Pr(t_2 > t_2^*)\times 0 + \Pr(t_2<t_2^*) \times \frac12 \times (t_1 -0) = \frac{t_1 t_2^*}{2}$ .

$\frac13$ $0$ if the expected payoff for player 1 of two different options are the same.

\begin{matrix} (39) & t_{1} = \frac{1}{3} + \frac{1}{3} t_{2}^{*} \end{matrix}

$\frac13$ $0$ if

\begin{matrix} (40) & t_{2} = \frac{1}{3} + \frac{1}{3} t_{1}^{*} \end{matrix}

Defination

$T_1$ $T_2$ $(s_1^*, s_2^*)$ BNE $t_i$ $s_i^*(t_i)$ solves

\begin{matrix} (41) & max_{a_{i} \in A_{i}, t_{j} \in T_{j}} u_{i} (a_{i}, s_{- i}^{*} (t_{j}), t_{j}) \underset{i}{Pr} (t_{j} ∣ t_{i}) \end{matrix}

$\Pr_i(t_j \mid t_i)$ $i$ $j$ $t_j$ $i$ $t_i$ .

Standard Auction Formats

$i$ $b_i$ $i$ $b_i$ .
Second price auction: highest bidder wins and pays the second highest bid.
English auction: Bidder dynamically submit successively higher bids. Final bidder wins and pays her final bid.
Dutch auction: Auctioner starts at a high price, successively announces lower price until someone bids. The lowerst bidder wins and pays the current price;

Homework

Intuitively explain the difference and simularities between Dutch, English and first price.

[Solution] of the sealed bid second price auction

$\hat b_{-i} = \max_{j\neq i} b_j$ .

Consider the following cases,

	$i$ $b_i^\prime < v_i$	$i$ $b_i^* = v_i$	$i$ $b_i^{\prime \prime } > v_i$
$\hat b_{-i} \leq b_i^\prime$	$v_i - \hat b_{-i}$	$v_i - \hat b_{-i}$	$v_i - \hat b_{-i}$
$b_i^\prime < \hat b_{-i} \leq v_i$	$0$	$v_i - \hat b_{-i}$	$v_i - \hat b_{-i}$
$v_i < \hat b_{-i} \leq b_i^{\prime \prime }$	$0$	$0$	$v_i - \hat b_{-i}<0$
$\hat b_{-i} > b_i^{\prime \prime}$	$0$	$0$	$0$

$i$ $b_i^* = v_i$ is always the weakly dominant strategy.

$v_1$ $v_2$ for 2. They are i.i.d.

$v_1 \sim U(0,1)$ $v_2 \sim U(0,1)$ .

Consider the first price sealed bid auction. Buyers strategies depends on their valuation.

$B_i(v_i) = \hat v_i$ $i$ $v_i$ $B$ $v$ $w$ is your opponent value.

A BNE is a pair of strategies such that

\begin{matrix} (42) & max (v - \hat{v}) Pr (\hat{v} > B (w)) + \frac{1}{2} (v - \hat{v}) \underset{0}{\underset{⏟}{Pr (\hat{v} = B (w))}} \end{matrix}

$\exist B^{-1} = \phi$ $B(v) = \hat v \Rightarrow \phi(\hat v) = v$ .

Then,

\begin{matrix} (43) & Pr (\hat{v} > B (w)) = Pr (ϕ (\hat{v}) > w) = ϕ (\hat{v}) \end{matrix}

Then the maximization problem becomes,

\begin{matrix} (44) & max_{\hat{v}} (v - \hat{v}) ϕ (\hat{v}) \end{matrix}

$- \phi(\hat v) + (v - \hat v) \phi^\prime (\hat v) = 0$ $B(v) = \hat v$ .

\begin{matrix} (45) & \begin{aligned} ϕ^{'} (\hat{v}) & = \frac{\partial v}{\partial \hat{v}} \\ v = ϕ (\hat{v}) & = (v - \hat{v}) ϕ^{'} (\hat{v}) \end{aligned} \end{matrix}

$v = (v - \hat v) \frac{\partial \hat v}{\partial v} \Rightarrow v = v\frac{\partial \hat v}{\partial v} + \hat v \Rightarrow v = \frac{\partial \hat v v}{\partial v}$

$\frac12 v^2 = v \hat v + c$ $c = 0$ .

$\hat v = \frac12 v$ .

$\frac12 v \Pr(v>w) = \frac12 v^2$ .

$2 \int_0^1 \frac12 v^2 dv = \frac13$ .

Homework:
Find the expected payment of winner, expected gain of winner, and expected seller gain under second price sealed bid auction.
$b_i = v_i$ $v_i \sim U(0,1)$
$v,w$ as two players' valuation.
expected payment of winner
$\begin{matrix} (46) & \begin{aligned} E [B (v)] & = \int_{0}^{v} w d w + \frac{1}{2} \underset{0}{\underset{⏟}{Pr (w = v)}} w \\ = \frac{1}{2} v^{2} \end{aligned} \end{matrix}$
Expected gain of winner
$\begin{matrix} (47) & E [B (v) - w] = \int_{0}^{v} v - w d w = \frac{1}{2} v^{2} \end{matrix}$
Expected seller gain
$\begin{matrix} (48) & 2 \int_{0}^{1} \frac{1}{2} v^{2} d v = \frac{1}{3} \end{matrix}$

Labor market signaling

$\theta$ $\theta_H$ $\lambda$ $\theta_L$ $1-\lambda$ $\theta_H > \theta_L> 0$ $e \geq 0$ .

$e$ $\theta$ $\theta$ $e$ $v(e, \theta ) = \theta$ . this implies that education is just a signal and does not add value.

$e$ $\theta$ $c(e,\theta)$ .

$c(e, \theta) : c_e>0, c_{ee}> 0$ $c(0,\theta) = 0$ $c_\theta<0$ $c_{e\theta}<0$ .

$\theta$ $e$ $w$ is

\begin{matrix} (49) & u (e, w ∣ θ) = w - c (e, θ) \end{matrix}

The profit to the firm is

\begin{matrix} (50) & v (e, θ) = v (e, θ) - w = θ - w \end{matrix}

We look for symmetric PBE in pure strategies, That is we look for 3 functions,

$w^*(e)$ is wage paid by firm as a function of observed education level
$e^*(\theta)$ is education level chosen by worker as a function of their ability type
$\mu^*(e) = \Pr(\theta = \theta_H \mid e)$ $e$ is a high type.

To be PBE, the functions must satisfy the following conditions,

$w^*(e)= \mu^*(e) \theta_H + (1- \mu^*(e)) \theta_L$ $e \geq 0$ . This is the zero profit condition.
$e^*(\theta) \in \arg \max w^*(e) - c(e,\theta)$ $\theta$ . This is the best response condition.
$\mu ^* (e)$ $e\in \{e^*(\theta_L), e^*(\theta_H)\}$ .

$e_L = e^*(\theta_L), e_H = e^*(\theta_H), w_L = w^*(e_L), w_H = w^*(e_H)$ .

There are two types of PBE,

$e_H = e_L$ . Two types of workers choose same level of education.
$e_H \neq e_L$ . Two types of workers choose different level of education.

Let's discuss the separating equilibrium first

Tip

$w^*\left(e^*\left(\theta_H\right)\right)=\theta_H$ $w^*\left(e^*\left(\theta_L\right)\right)=\theta_L$ .

$e_L^*$ $\theta_L$ $w^*(e^*(\theta_L)) = \theta_L$ . Otherwise the firm will suffer from negative profit.

$e^*_L$ $\theta_H$

$e^*_L = 0$ . Otherwise they will have incentive to deviate.

Game Theory

Mixed action

Solution concepts

Extensive form games

Repeated Games

Perfect Bayesian Equilibrium

Auction

Labor market signaling